Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 10874 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.7 MiB |
| Average record size in memory | 160.0 B |
Variable types
| NUM | 14 |
|---|---|
| CAT | 5 |
| BOOL | 1 |
Reproduction
| Analysis started | 2021-11-24 10:05:04.156364 |
|---|---|
| Analysis finished | 2021-11-24 10:05:23.879777 |
| Duration | 19.72 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Avg_Open_To_Buy is highly correlated with Credit_Limit | High correlation |
Credit_Limit is highly correlated with Avg_Open_To_Buy | High correlation |
Dependent_count has 920 (8.5%) zeros | Zeros |
Contacts_Count_12_mon has 277 (2.5%) zeros | Zeros |
Total_Revolving_Bal has 3875 (35.6%) zeros | Zeros |
Avg_Utilization_Ratio has 3872 (35.6%) zeros | Zeros |
Customer_Age
Real number (ℝ≥0)
| Distinct count | 44 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.22484826190914 |
|---|---|
| Minimum | 26 |
| Maximum | 73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 26 |
|---|---|
| 5-th percentile | 34 |
| Q1 | 41 |
| median | 46 |
| Q3 | 51 |
| 95-th percentile | 58 |
| Maximum | 73 |
| Range | 47 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 7.373765181 |
|---|---|
| Coefficient of variation (CV) | 0.1595195108 |
| Kurtosis | -0.09047834281 |
| Mean | 46.22484826 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.03106652988 |
| Sum | 502649 |
| Variance | 54.37241295 |
| Value | Count | Frequency (%) | |
| 46 | 616 | 5.7% | |
| 44 | 599 | 5.5% | |
| 48 | 592 | 5.4% | |
| 47 | 577 | 5.3% | |
| 49 | 559 | 5.1% | |
| 45 | 557 | 5.1% | |
| 43 | 540 | 5.0% | |
| 42 | 521 | 4.8% | |
| 50 | 501 | 4.6% | |
| 41 | 485 | 4.5% | |
| Other values (34) | 5327 | 49.0% |
| Value | Count | Frequency (%) | |
| 26 | 60 | 0.6% | |
| 27 | 22 | 0.2% | |
| 28 | 24 | 0.2% | |
| 29 | 39 | 0.4% | |
| 30 | 71 | 0.7% |
| Value | Count | Frequency (%) | |
| 73 | 1 | < 0.1% | |
| 68 | 2 | < 0.1% | |
| 67 | 3 | < 0.1% | |
| 66 | 2 | < 0.1% | |
| 65 | 63 | 0.6% |
Gender
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 85.0 KiB |
| F | |
|---|---|
| M |
| Value | Count | Frequency (%) | |
| F | 6079 | 55.9% | |
| M | 4795 | 44.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.226871436453927 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 920 |
| Zeros (%) | 8.5% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.20642796 |
|---|---|
| Coefficient of variation (CV) | 0.5417591427 |
| Kurtosis | -0.4983526609 |
| Mean | 2.226871436 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.02289801855 |
| Sum | 24215 |
| Variance | 1.455468423 |
| Value | Count | Frequency (%) | |
| 2 | 3323 | 30.6% | |
| 3 | 2992 | 27.5% | |
| 1 | 2081 | 19.1% | |
| 4 | 1278 | 11.8% | |
| 0 | 920 | 8.5% | |
| 5 | 280 | 2.6% |
| Value | Count | Frequency (%) | |
| 0 | 920 | 8.5% | |
| 1 | 2081 | 19.1% | |
| 2 | 3323 | 30.6% | |
| 3 | 2992 | 27.5% | |
| 4 | 1278 | 11.8% |
| Value | Count | Frequency (%) | |
| 5 | 280 | 2.6% | |
| 4 | 1278 | 11.8% | |
| 3 | 2992 | 27.5% | |
| 2 | 3323 | 30.6% | |
| 1 | 2081 | 19.1% |
Education_Level
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 85.0 KiB |
| Graduate | |
|---|---|
| High School | |
| Doctorate | |
| Uneducated | |
| College |
| Value | Count | Frequency (%) | |
| Graduate | 4458 | 41.0% | |
| High School | 2244 | 20.6% | |
| Doctorate | 1417 | 13.0% | |
| Uneducated | 1384 | 12.7% | |
| College | 909 | 8.4% | |
| Post-Graduate | 462 | 4.2% |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.13279382 |
| Min length | 7 |
Marital_Status
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 85.0 KiB |
| Married | |
|---|---|
| Single | |
| Divorced | 613 |
| Value | Count | Frequency (%) | |
| Married | 6053 | 55.7% | |
| Single | 4208 | 38.7% | |
| Divorced | 613 | 5.6% |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.669394887 |
| Min length | 6 |
Income_Category
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 85.0 KiB |
| Less than $40K | |
|---|---|
| $80K - $120K | |
| $40K - $60K | |
| $60K - $80K | |
| $120K + | 739 |
| Value | Count | Frequency (%) | |
| Less than $40K | 4599 | 42.3% | |
| $80K - $120K | 2291 | 21.1% | |
| $40K - $60K | 1880 | 17.3% | |
| $60K - $80K | 1365 | 12.6% | |
| $120K + | 739 | 6.8% |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 12.20765128 |
| Min length | 7 |
Card_Category
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 85.0 KiB |
| Blue | |
|---|---|
| Silver | 540 |
| Gold | 101 |
| Platinum | 10 |
| Value | Count | Frequency (%) | |
| Blue | 10223 | 94.0% | |
| Silver | 540 | 5.0% | |
| Gold | 101 | 0.9% | |
| Platinum | 10 | 0.1% |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.102997977 |
| Min length | 4 |
Months_on_book
Real number (ℝ≥0)
| Distinct count | 44 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.803935994114404 |
|---|---|
| Minimum | 13 |
| Maximum | 56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 32 |
| median | 36 |
| Q3 | 40 |
| 95-th percentile | 48 |
| Maximum | 56 |
| Range | 43 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 7.366607013 |
|---|---|
| Coefficient of variation (CV) | 0.2057485248 |
| Kurtosis | 0.6094624981 |
| Mean | 35.80393599 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.1255156753 |
| Sum | 389332 |
| Variance | 54.26689888 |
| Value | Count | Frequency (%) | |
| 36 | 2235 | 20.6% | |
| 35 | 524 | 4.8% | |
| 34 | 490 | 4.5% | |
| 37 | 463 | 4.3% | |
| 38 | 449 | 4.1% | |
| 33 | 423 | 3.9% | |
| 39 | 423 | 3.9% | |
| 32 | 393 | 3.6% | |
| 40 | 386 | 3.5% | |
| 31 | 381 | 3.5% | |
| Other values (34) | 4707 | 43.3% |
| Value | Count | Frequency (%) | |
| 13 | 49 | 0.5% | |
| 14 | 18 | 0.2% | |
| 15 | 31 | 0.3% | |
| 16 | 23 | 0.2% | |
| 17 | 32 | 0.3% |
| Value | Count | Frequency (%) | |
| 56 | 67 | 0.6% | |
| 55 | 31 | 0.3% | |
| 54 | 34 | 0.3% | |
| 53 | 50 | 0.5% | |
| 52 | 46 | 0.4% |
Total_Relationship_Count
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4221077800257493 |
|---|---|
| Minimum | 1 |
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.505394368 |
|---|---|
| Coefficient of variation (CV) | 0.4399026755 |
| Kurtosis | -0.9491254649 |
| Mean | 3.42210778 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1281667224 |
| Sum | 37212 |
| Variance | 2.266212204 |
| Value | Count | Frequency (%) | |
| 3 | 2746 | 25.3% | |
| 2 | 2023 | 18.6% | |
| 4 | 1990 | 18.3% | |
| 5 | 1687 | 15.5% | |
| 6 | 1221 | 11.2% | |
| 1 | 1207 | 11.1% |
| Value | Count | Frequency (%) | |
| 1 | 1207 | 11.1% | |
| 2 | 2023 | 18.6% | |
| 3 | 2746 | 25.3% | |
| 4 | 1990 | 18.3% | |
| 5 | 1687 | 15.5% |
| Value | Count | Frequency (%) | |
| 6 | 1221 | 11.2% | |
| 5 | 1687 | 15.5% | |
| 4 | 1990 | 18.3% | |
| 3 | 2746 | 25.3% | |
| 2 | 2023 | 18.6% |
Months_Inactive_12_mon
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3682177671510023 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 30 |
| Zeros (%) | 0.3% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9086712506 |
|---|---|
| Coefficient of variation (CV) | 0.3836941278 |
| Kurtosis | 1.348401594 |
| Mean | 2.368217767 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.5509799014 |
| Sum | 25752 |
| Variance | 0.8256834416 |
| Value | Count | Frequency (%) | |
| 2 | 4250 | 39.1% | |
| 3 | 4181 | 38.4% | |
| 1 | 1755 | 16.1% | |
| 4 | 413 | 3.8% | |
| 5 | 168 | 1.5% | |
| 6 | 77 | 0.7% | |
| 0 | 30 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 30 | 0.3% | |
| 1 | 1755 | 16.1% | |
| 2 | 4250 | 39.1% | |
| 3 | 4181 | 38.4% | |
| 4 | 413 | 3.8% |
| Value | Count | Frequency (%) | |
| 6 | 77 | 0.7% | |
| 5 | 168 | 1.5% | |
| 4 | 413 | 3.8% | |
| 3 | 4181 | 38.4% | |
| 2 | 4250 | 39.1% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.522254919992643 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 277 |
| Zeros (%) | 2.5% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.028031397 |
|---|---|
| Coefficient of variation (CV) | 0.4075842566 |
| Kurtosis | 0.2600013447 |
| Mean | 2.52225492 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.0262617261 |
| Sum | 27427 |
| Variance | 1.056848552 |
| Value | Count | Frequency (%) | |
| 3 | 4050 | 37.2% | |
| 2 | 3648 | 33.5% | |
| 4 | 1323 | 12.2% | |
| 1 | 1308 | 12.0% | |
| 0 | 277 | 2.5% | |
| 5 | 227 | 2.1% | |
| 6 | 41 | 0.4% |
| Value | Count | Frequency (%) | |
| 0 | 277 | 2.5% | |
| 1 | 1308 | 12.0% | |
| 2 | 3648 | 33.5% | |
| 3 | 4050 | 37.2% | |
| 4 | 1323 | 12.2% |
| Value | Count | Frequency (%) | |
| 6 | 41 | 0.4% | |
| 5 | 227 | 2.1% | |
| 4 | 1323 | 12.2% | |
| 3 | 4050 | 37.2% | |
| 2 | 3648 | 33.5% |
| Distinct count | 8288 |
|---|---|
| Unique (%) | 76.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8546.014396974913 |
|---|---|
| Minimum | 1438.3 |
| Maximum | 34516.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 1438.3 |
|---|---|
| 5-th percentile | 1438.3 |
| Q1 | 2413.25 |
| median | 4402 |
| Q3 | 10819.56822 |
| 95-th percentile | 34516 |
| Maximum | 34516 |
| Range | 33077.7 |
| Interquartile range (IQR) | 8406.31822 |
Descriptive statistics
| Standard deviation | 9203.534695 |
|---|---|
| Coefficient of variation (CV) | 1.076938824 |
| Kurtosis | 1.825459053 |
| Mean | 8546.014397 |
| Median Absolute Deviation (MAD) | 2623.327158 |
| Skewness | 1.683041215 |
| Sum | 92929360.55 |
| Variance | 84705050.88 |
| Value | Count | Frequency (%) | |
| 1438.3 | 617 | 5.7% | |
| 34516 | 593 | 5.5% | |
| 9959 | 12 | 0.1% | |
| 15987 | 10 | 0.1% | |
| 23981 | 8 | 0.1% | |
| 2222 | 7 | 0.1% | |
| 2001 | 7 | 0.1% | |
| 2490 | 7 | 0.1% | |
| 2423 | 6 | 0.1% | |
| 6224 | 6 | 0.1% | |
| Other values (8278) | 9601 | 88.3% |
| Value | Count | Frequency (%) | |
| 1438.3 | 617 | 5.7% | |
| 1438.334277 | 1 | < 0.1% | |
| 1438.334381 | 1 | < 0.1% | |
| 1438.530168 | 1 | < 0.1% | |
| 1438.586209 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 34516 | 593 | 5.5% | |
| 34496 | 1 | < 0.1% | |
| 34360.9758 | 1 | < 0.1% | |
| 34283.00937 | 1 | < 0.1% | |
| 34198 | 1 | < 0.1% |
| Distinct count | 2211 |
|---|---|
| Unique (%) | 20.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 953.8397093985653 |
|---|---|
| Minimum | 0 |
| Maximum | 2517 |
| Zeros | 3875 |
| Zeros (%) | 35.6% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 917.5 |
| Q3 | 1696 |
| 95-th percentile | 2517 |
| Maximum | 2517 |
| Range | 2517 |
| Interquartile range (IQR) | 1696 |
Descriptive statistics
| Standard deviation | 891.0682687 |
|---|---|
| Coefficient of variation (CV) | 0.9341907868 |
| Kurtosis | -1.334852901 |
| Mean | 953.8397094 |
| Median Absolute Deviation (MAD) | 917.5 |
| Skewness | 0.3038857608 |
| Sum | 10372053 |
| Variance | 794002.6594 |
| Value | Count | Frequency (%) | |
| 0 | 3875 | 35.6% | |
| 2517 | 616 | 5.7% | |
| 1650 | 10 | 0.1% | |
| 1664 | 9 | 0.1% | |
| 868 | 9 | 0.1% | |
| 1829 | 9 | 0.1% | |
| 859 | 9 | 0.1% | |
| 1544 | 9 | 0.1% | |
| 2216 | 8 | 0.1% | |
| 1491 | 8 | 0.1% | |
| Other values (2201) | 6312 | 58.0% |
| Value | Count | Frequency (%) | |
| 0 | 3875 | 35.6% | |
| 1 | 2 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2517 | 616 | 5.7% | |
| 2515 | 1 | < 0.1% | |
| 2514 | 7 | 0.1% | |
| 2512 | 4 | < 0.1% | |
| 2511 | 3 | < 0.1% |
| Distinct count | 8903 |
|---|---|
| Unique (%) | 81.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7592.090239423786 |
|---|---|
| Minimum | 3.0 |
| Maximum | 34516.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 455.5155101 |
| Q1 | 1438.3 |
| median | 3515 |
| Q3 | 9759.964647 |
| 95-th percentile | 32243.35 |
| Maximum | 34516 |
| Range | 34513 |
| Interquartile range (IQR) | 8321.664647 |
Descriptive statistics
| Standard deviation | 9224.752281 |
|---|---|
| Coefficient of variation (CV) | 1.215047765 |
| Kurtosis | 1.807575191 |
| Mean | 7592.090239 |
| Median Absolute Deviation (MAD) | 2613.35 |
| Skewness | 1.676472785 |
| Sum | 82556389.26 |
| Variance | 85096054.64 |
| Value | Count | Frequency (%) | |
| 1438.3 | 450 | 4.1% | |
| 34516 | 151 | 1.4% | |
| 31999 | 44 | 0.4% | |
| 740 | 6 | 0.1% | |
| 953 | 6 | 0.1% | |
| 1129 | 5 | < 0.1% | |
| 519 | 5 | < 0.1% | |
| 983 | 5 | < 0.1% | |
| 701 | 5 | < 0.1% | |
| 463 | 5 | < 0.1% | |
| Other values (8893) | 10192 | 93.7% |
| Value | Count | Frequency (%) | |
| 3 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 10.71998857 | 1 | < 0.1% | |
| 10.87361879 | 1 | < 0.1% | |
| 12.09100071 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 34516 | 151 | 1.4% | |
| 34504.5681 | 1 | < 0.1% | |
| 34502.11452 | 1 | < 0.1% | |
| 34494.64929 | 1 | < 0.1% | |
| 34494.03572 | 1 | < 0.1% |
Total_Amt_Chng_Q4_Q1
Real number (ℝ≥0)
| Distinct count | 5395 |
|---|---|
| Unique (%) | 49.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.734359804631299 |
|---|---|
| Minimum | 0.0 |
| Maximum | 3.397 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.4301134112 |
| Q1 | 0.6050190724 |
| median | 0.721 |
| Q3 | 0.853 |
| 95-th percentile | 1.041545085 |
| Maximum | 3.397 |
| Range | 3.397 |
| Interquartile range (IQR) | 0.2479809276 |
Descriptive statistics
| Standard deviation | 0.2066949838 |
|---|---|
| Coefficient of variation (CV) | 0.2814628231 |
| Kurtosis | 6.444132701 |
| Mean | 0.7343598046 |
| Median Absolute Deviation (MAD) | 0.1237696319 |
| Skewness | 1.041128455 |
| Sum | 7985.428516 |
| Variance | 0.04272281631 |
| Value | Count | Frequency (%) | |
| 0.791 | 27 | 0.2% | |
| 0.838 | 24 | 0.2% | |
| 0.722 | 23 | 0.2% | |
| 0.718 | 23 | 0.2% | |
| 0.744 | 23 | 0.2% | |
| 0.631 | 23 | 0.2% | |
| 0.715 | 22 | 0.2% | |
| 0.681 | 22 | 0.2% | |
| 0.743 | 22 | 0.2% | |
| 0.742 | 22 | 0.2% | |
| Other values (5385) | 10643 | 97.9% |
| Value | Count | Frequency (%) | |
| 0 | 3 | < 0.1% | |
| 0.009776142186 | 1 | < 0.1% | |
| 0.01 | 1 | < 0.1% | |
| 0.05371510409 | 1 | < 0.1% | |
| 0.061 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3.397 | 1 | < 0.1% | |
| 2.594 | 1 | < 0.1% | |
| 2.368 | 1 | < 0.1% | |
| 2.275 | 1 | < 0.1% | |
| 2.121 | 1 | < 0.1% |
Total_Trans_Amt
Real number (ℝ≥0)
| Distinct count | 5104 |
|---|---|
| Unique (%) | 46.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3871.287658635277 |
|---|---|
| Minimum | 569 |
| Maximum | 18484 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 569 |
|---|---|
| 5-th percentile | 1131 |
| Q1 | 2015 |
| median | 2668 |
| Q3 | 4586 |
| 95-th percentile | 9325.7 |
| Maximum | 18484 |
| Range | 17915 |
| Interquartile range (IQR) | 2571 |
Descriptive statistics
| Standard deviation | 3051.611876 |
|---|---|
| Coefficient of variation (CV) | 0.7882679213 |
| Kurtosis | 4.8227172 |
| Mean | 3871.287659 |
| Median Absolute Deviation (MAD) | 1133.5 |
| Skewness | 2.135184252 |
| Sum | 42096382 |
| Variance | 9312335.039 |
| Value | Count | Frequency (%) | |
| 2447 | 13 | 0.1% | |
| 2345 | 12 | 0.1% | |
| 2126 | 12 | 0.1% | |
| 2269 | 12 | 0.1% | |
| 1935 | 11 | 0.1% | |
| 2281 | 11 | 0.1% | |
| 1687 | 10 | 0.1% | |
| 2418 | 10 | 0.1% | |
| 2136 | 10 | 0.1% | |
| 2388 | 10 | 0.1% | |
| Other values (5094) | 10763 | 99.0% |
| Value | Count | Frequency (%) | |
| 569 | 1 | < 0.1% | |
| 585 | 1 | < 0.1% | |
| 594 | 1 | < 0.1% | |
| 596 | 2 | < 0.1% | |
| 613 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 18484 | 1 | < 0.1% | |
| 17744 | 1 | < 0.1% | |
| 17634 | 1 | < 0.1% | |
| 17437 | 1 | < 0.1% | |
| 17093 | 1 | < 0.1% |
Total_Trans_Ct
Real number (ℝ≥0)
| Distinct count | 124 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56.667279749862054 |
|---|---|
| Minimum | 10 |
| Maximum | 139 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 40 |
| median | 51 |
| Q3 | 73 |
| 95-th percentile | 95 |
| Maximum | 139 |
| Range | 129 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 22.50941987 |
|---|---|
| Coefficient of variation (CV) | 0.397220759 |
| Kurtosis | -0.06220861498 |
| Mean | 56.66727975 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.5980179399 |
| Sum | 616200 |
| Variance | 506.6739831 |
| Value | Count | Frequency (%) | |
| 41 | 327 | 3.0% | |
| 42 | 316 | 2.9% | |
| 43 | 312 | 2.9% | |
| 40 | 281 | 2.6% | |
| 39 | 281 | 2.6% | |
| 44 | 271 | 2.5% | |
| 38 | 270 | 2.5% | |
| 45 | 260 | 2.4% | |
| 36 | 234 | 2.2% | |
| 37 | 232 | 2.1% | |
| Other values (114) | 8090 | 74.4% |
| Value | Count | Frequency (%) | |
| 10 | 2 | < 0.1% | |
| 11 | 3 | < 0.1% | |
| 12 | 9 | 0.1% | |
| 13 | 7 | 0.1% | |
| 14 | 21 | 0.2% |
| Value | Count | Frequency (%) | |
| 139 | 1 | < 0.1% | |
| 138 | 1 | < 0.1% | |
| 131 | 5 | < 0.1% | |
| 130 | 2 | < 0.1% | |
| 129 | 4 | < 0.1% |
Total_Ct_Chng_Q4_Q1
Real number (ℝ≥0)
| Distinct count | 5113 |
|---|---|
| Unique (%) | 47.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6475065286655576 |
|---|---|
| Minimum | 0.0 |
| Maximum | 3.25 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.312098424 |
| Q1 | 0.4968741117 |
| median | 0.6400350781 |
| Q3 | 0.778 |
| 95-th percentile | 1 |
| Maximum | 3.25 |
| Range | 3.25 |
| Interquartile range (IQR) | 0.2811258883 |
Descriptive statistics
| Standard deviation | 0.2302693108 |
|---|---|
| Coefficient of variation (CV) | 0.3556246935 |
| Kurtosis | 8.095188764 |
| Mean | 0.6475065287 |
| Median Absolute Deviation (MAD) | 0.1400350781 |
| Skewness | 1.291232781 |
| Sum | 7040.985993 |
| Variance | 0.0530239555 |
| Value | Count | Frequency (%) | |
| 1 | 113 | 1.0% | |
| 0.667 | 111 | 1.0% | |
| 0.5 | 109 | 1.0% | |
| 0.75 | 97 | 0.9% | |
| 0.6 | 76 | 0.7% | |
| 0.714 | 63 | 0.6% | |
| 0.8 | 62 | 0.6% | |
| 0.833 | 55 | 0.5% | |
| 0.778 | 45 | 0.4% | |
| 0.571 | 38 | 0.3% | |
| Other values (5103) | 10105 | 92.9% |
| Value | Count | Frequency (%) | |
| 0 | 5 | < 0.1% | |
| 0.005846974992 | 1 | < 0.1% | |
| 0.028 | 1 | < 0.1% | |
| 0.03028698807 | 1 | < 0.1% | |
| 0.038 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3.25 | 1 | < 0.1% | |
| 3 | 2 | < 0.1% | |
| 2.875 | 1 | < 0.1% | |
| 2.75 | 1 | < 0.1% | |
| 2.4 | 1 | < 0.1% |
| Distinct count | 3023 |
|---|---|
| Unique (%) | 27.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22810309851643076 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.999 |
| Zeros | 3872 |
| Zeros (%) | 35.6% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.09029051318 |
| Q3 | 0.416 |
| 95-th percentile | 0.807 |
| Maximum | 0.999 |
| Range | 0.999 |
| Interquartile range (IQR) | 0.416 |
Descriptive statistics
| Standard deviation | 0.2781451934 |
|---|---|
| Coefficient of variation (CV) | 1.21938367 |
| Kurtosis | -0.2593014475 |
| Mean | 0.2281030985 |
| Median Absolute Deviation (MAD) | 0.09029051318 |
| Skewness | 1.029154209 |
| Sum | 2480.393093 |
| Variance | 0.0773647486 |
| Value | Count | Frequency (%) | |
| 0 | 3872 | 35.6% | |
| 0.073 | 51 | 0.5% | |
| 0.061 | 21 | 0.2% | |
| 0.048 | 21 | 0.2% | |
| 0.059 | 21 | 0.2% | |
| 0.057 | 19 | 0.2% | |
| 0.045 | 18 | 0.2% | |
| 0.071 | 18 | 0.2% | |
| 0.056 | 18 | 0.2% | |
| 0.07 | 17 | 0.2% | |
| Other values (3013) | 6798 | 62.5% |
| Value | Count | Frequency (%) | |
| 0 | 3872 | 35.6% | |
| 1.120423815e-06 | 1 | < 0.1% | |
| 4.728281387e-05 | 1 | < 0.1% | |
| 5.308497479e-05 | 1 | < 0.1% | |
| 7.841971162e-05 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.999 | 1 | < 0.1% | |
| 0.995 | 1 | < 0.1% | |
| 0.9944600086 | 1 | < 0.1% | |
| 0.9943447859 | 1 | < 0.1% | |
| 0.994 | 1 | < 0.1% |
Attrition_Flag
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 85.0 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 5437 | 50.0% | |
| 1 | 5437 | 50.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Customer_Age | Gender | Dependent_count | Education_Level | Marital_Status | Income_Category | Card_Category | Months_on_book | Total_Relationship_Count | Months_Inactive_12_mon | Contacts_Count_12_mon | Credit_Limit | Total_Revolving_Bal | Avg_Open_To_Buy | Total_Amt_Chng_Q4_Q1 | Total_Trans_Amt | Total_Trans_Ct | Total_Ct_Chng_Q4_Q1 | Avg_Utilization_Ratio | Attrition_Flag | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 35 | F | 2 | Graduate | Married | Less than $40K | Blue | 36 | 5 | 2 | 4 | 5546.0 | 1829 | 3717.0 | 0.673 | 1770 | 46 | 1.000 | 0.330 | 0 |
| 1 | 57 | F | 1 | Uneducated | Single | Less than $40K | Blue | 46 | 2 | 3 | 3 | 9129.0 | 0 | 9129.0 | 0.733 | 7733 | 81 | 0.884 | 0.000 | 1 |
| 2 | 59 | F | 1 | Graduate | Single | Less than $40K | Blue | 52 | 5 | 3 | 3 | 2636.0 | 1619 | 1017.0 | 0.565 | 4727 | 83 | 0.627 | 0.614 | 0 |
| 3 | 45 | F | 4 | Graduate | Single | Less than $40K | Blue | 33 | 2 | 1 | 3 | 3001.0 | 2055 | 946.0 | 0.838 | 4662 | 70 | 0.628 | 0.685 | 0 |
| 4 | 43 | M | 2 | Doctorate | Single | $120K + | Blue | 34 | 5 | 3 | 4 | 33913.0 | 0 | 33913.0 | 0.901 | 4160 | 76 | 0.900 | 0.000 | 0 |
| 5 | 63 | M | 0 | College | Single | $80K - $120K | Blue | 56 | 6 | 2 | 2 | 9033.0 | 0 | 9033.0 | 0.780 | 4456 | 85 | 0.635 | 0.000 | 0 |
| 6 | 51 | F | 3 | High School | Married | Less than $40K | Blue | 35 | 6 | 1 | 3 | 3025.0 | 1912 | 1113.0 | 0.674 | 4626 | 78 | 0.773 | 0.632 | 0 |
| 7 | 48 | M | 3 | Graduate | Married | $80K - $120K | Blue | 38 | 3 | 3 | 3 | 31501.0 | 0 | 31501.0 | 0.698 | 3742 | 75 | 0.786 | 0.000 | 0 |
| 8 | 37 | M | 2 | High School | Married | $120K + | Blue | 32 | 6 | 1 | 1 | 34516.0 | 0 | 34516.0 | 0.928 | 3660 | 59 | 1.107 | 0.000 | 0 |
| 9 | 46 | M | 2 | Graduate | Divorced | $80K - $120K | Blue | 30 | 3 | 3 | 3 | 2770.0 | 1286 | 1484.0 | 1.068 | 2920 | 74 | 0.721 | 0.464 | 0 |
Last rows
| Customer_Age | Gender | Dependent_count | Education_Level | Marital_Status | Income_Category | Card_Category | Months_on_book | Total_Relationship_Count | Months_Inactive_12_mon | Contacts_Count_12_mon | Credit_Limit | Total_Revolving_Bal | Avg_Open_To_Buy | Total_Amt_Chng_Q4_Q1 | Total_Trans_Amt | Total_Trans_Ct | Total_Ct_Chng_Q4_Q1 | Avg_Utilization_Ratio | Attrition_Flag | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10864 | 32 | F | 2 | Graduate | Married | Less than $40K | Blue | 36 | 5 | 3 | 3 | 1438.300000 | 0 | 1438.300000 | 0.837858 | 2447 | 40 | 0.582008 | 0.000000 | 1 |
| 10865 | 52 | F | 2 | Uneducated | Married | Less than $40K | Blue | 37 | 6 | 3 | 1 | 3548.475463 | 2517 | 1031.475463 | 0.693004 | 2375 | 41 | 0.469966 | 0.709643 | 1 |
| 10866 | 61 | F | 0 | College | Single | Less than $40K | Blue | 36 | 3 | 3 | 4 | 3808.107115 | 2517 | 1291.107115 | 0.584270 | 2335 | 46 | 0.732586 | 0.660721 | 1 |
| 10867 | 45 | M | 2 | High School | Single | $40K - $60K | Blue | 36 | 4 | 2 | 1 | 15241.134711 | 883 | 14358.065847 | 0.529461 | 2239 | 47 | 0.512811 | 0.057734 | 1 |
| 10868 | 43 | M | 2 | Graduate | Married | $80K - $120K | Blue | 36 | 2 | 2 | 2 | 5004.930220 | 2517 | 2487.930220 | 0.515561 | 1634 | 35 | 0.351238 | 0.502872 | 1 |
| 10869 | 59 | M | 1 | High School | Single | Less than $40K | Blue | 50 | 1 | 4 | 3 | 4957.260741 | 594 | 4362.410205 | 0.839495 | 9930 | 68 | 0.885691 | 0.117689 | 1 |
| 10870 | 55 | F | 0 | Doctorate | Single | $40K - $60K | Blue | 45 | 5 | 3 | 2 | 2308.105503 | 0 | 2308.105503 | 0.607105 | 2238 | 40 | 0.606759 | 0.000000 | 1 |
| 10871 | 54 | M | 1 | Graduate | Married | $80K - $120K | Blue | 43 | 3 | 2 | 3 | 11690.525890 | 265 | 11424.624300 | 0.707986 | 2469 | 48 | 0.463102 | 0.022386 | 1 |
| 10872 | 43 | F | 3 | Uneducated | Married | Less than $40K | Blue | 25 | 3 | 2 | 2 | 3218.320631 | 0 | 3218.320631 | 0.451446 | 2099 | 41 | 0.481320 | 0.000000 | 1 |
| 10873 | 55 | F | 3 | Doctorate | Single | Less than $40K | Blue | 47 | 4 | 4 | 2 | 2422.759093 | 398 | 2024.641275 | 0.630990 | 2418 | 51 | 0.548982 | 0.163856 | 1 |